Unsupervised classification of uncertain data objects in spatial databases using computational geometry and indexing techniques

نویسنده

  • Ramachandra Rao Kurada
چکیده

Unsupervised classification called clustering is a process of organizing objects into groups whose members are similar in some way. Clustering of uncertain data objects is a challenge in spatial data bases. In this paper we use Probability Density Functions (PDF) to represent these uncertain data objects, and apply Uncertain K-Means algorithm to generate the clusters. This clustering algorithm uses the Expected Distance (ED) to compute the distance between objects and cluster representatives. To further improve the performance of UK-Means we propose a novel technique called Voronoi Diagrams from Computational Geometry to prune the number of computations of ED. This technique works efficiently but results pruning overheads. In order to reduce these in pruning overhead we introduce R*-tree indexing over these uncertain data objects, so that it reduces the computational cost and pruning overheads. Our novel approach of integrating UK-Means with voronoi diagrams and R* Tree applied over uncertain data objects generates imposing outcome when compared with the accessible methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Approach for clustering uncertain data objects: A Survey

Recently, uncertain data objects is used in various applications such as VANET environment, sensors applications, image processing based system etc. Clustering of uncertain data is a major concept in data mining since more and more applications, such as sensor database, location database, biometric information systems, and produce vague and imprecise data. Clustering of uncertain data objects i...

متن کامل

Embedding advanced geometric techniques into SQL for efficient indexing of mobile objects

It is of great importance a trial to embed new geometric techniques into SQL in order to achieve more efficient indexing of objects moving on the plane and answer range queries about their future positions. This problem is motivated by real-life applications, such as allocating more bandwidth for areas where high concentration of mobile phones is imminent, or predicting future congestion areas ...

متن کامل

Extending the Qualitative Trajectory Calculus Based on the Concept of Accessibility of Moving Objects in the Paths

Qualitative spatial representation and reasoning are among the important capabilities in intelligent geospatial information system development. Although a large contribution to the study of moving objects has been attributed to the quantitative use and analysis of data, such calculations are ineffective when there is little inaccurate data on position and geometry or when explicitly explaining ...

متن کامل

Indexing Constraint Databases by Using a Dual Representation

Linear constraint databases are a powerful framework to model spatial and temporal data. The use of constraint databases should be supported by access data structures that make effective use of secondary storage and reduce query processing time. Such structures should be able to store both finite and infinite objects and perform both containment (ALL) and intersection (EXIST) queries. As standa...

متن کامل

Search Problems for Speech and Audio Sequences

The modern proliferation of very large audio and video databases has created a need for effective methods of indexing and searching highly variable or uncertain data. Classical search and indexing algorithms deal with clean input sequences. However, an index created from speech or music transcriptions is marked with errors and uncertainties stemming from the use of imperfect statistical models ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1312.2378  شماره 

صفحات  -

تاریخ انتشار 2012